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GenCore version 6.3 
Copyright (c) 1993 - 2009 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



May 31, 2009, 21:51:55 



Search time 1876 Seconds 
(without alignments) 

47444.276 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-573-229A-1 

920 

1 tctgtagaggggaatggctg acccccaaagaaaccttcta 920 



Scoring table: 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 83780570 seqs, 48372533981 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 



Database 



EST : 



1 


gb_ 


_estl 




2 


gb_ 


_est2 




3 


gb_ 


_est3 




4 


gb_ 


_est4 




5 


gb_ 


_est5 




6 


gb_ 


_est6 




7 


gb_ 


_est7 




8 


gb_ 


_est8 
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9 : 


gb_est 9 : * 




10 


gb 


_estlO : 


* 


11 


gb_ 


_estll : 




12 


gb_ 


_estl2 : 




13 


gb_ 


_estl3 : 




14 


gb_ 


_estl4 : 


* 


15 


gb 


_estl5 : 




16 


gb 


_estl6 : 




17 


gb_ 


_estl7 : 


* 


18 


gb_ 


_estl8 : 




19 


gb_ 


_gssl : * 




20 


gb_ 


_gss2 : * 




21 


gb_ 


_gss3 : * 




22 


gb_ 


_gss4 : * 




23 


gb_ 


_gss5 : * 




24 


gb 


_gss6 : * 




25 


gb_ 


_gss7 : * 





Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


920 


100 


0 


920 


5 


BU183861 


BU183861 AGENCOURT 




2 


149 . 8 


16 


3 


988 


3 


BE733157 


BE733157 601567451 




3 


108 


11 


7 


533 


10 


DB059978 


DB059978 DB059978 


c 


4 


104.8 


11 


4 


478 


2 


AW269751 


AW269751 xv36gll.x 


c 


5 


104.8 


11 


4 


484 


2 


BE046932 


BE046932 hd92f06.x 




6 


104.4 


11 


3 


500 


10 


DB100179 


DB100179 DB100179 


c 


7 


103.2 


11 


2 


757 


20 


CC492937 


CC492937 CH240_327 




8 


101 . 4 


11 


0 


526 


10 


DB073121 


DB073121 DB073121 




9 


101 . 2 


11 


0 


669 


22 


EI735387 


EI735387 CHORI5142 


c 


10 


101.2 


11 


0 


702 


4 


BQ021722 


BQ021722 UI-H-DH1- 




11 


95 . 4 


10 


4 


549 


10 


DB074846 


DB074846 DB074846 


c 


12 


94.8 


10 


3 


446 


2 


AW297623 


AW297623 UI-H-BW0- 


c 


13 


94.6 


10 


3 


533 


1 


AA535137 


AA535137 nf87hl2.s 




14 


94.6 


10 


3 


637 


2 


AW970418 


AW970418 EST382499 


c 


15 


94 . 2 


10 


2 


429 


20 


CE539857 


CE539857 tigr-gss- 




16 


94 


10 


2 


707 


22 


EI399423 


EI399423 MUGQ_CH25 




17 


94 


10 


2 


805 


22 


ED482970 


ED4829 70 MUGQ_CH2 5 


c 


18 


92 . 8 


10 


1 


381 


25 


CU051182 


CU051182 Equus cab 


c 


19 


92 . 8 


10 


1 


466 


25 


CU288702 


CU288702 Equus cab 


c 


20 


92 . 4 


10 


0 


548 


10 


DB349144 


DB349144 DB349144 


c 


21 


91 . 8 


10 


0 


454 


4 


BM667926 


BM667926 UI-E-DW0- 




22 


91 . 8 


10 


0 


503 


4 


BM696584 


BM696584 UI-E-DW0- 
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23 


91 


8 


10 


0 


1147 


3 


BG119504 


BG119504 602349294 




24 


91 


4 


9 


9 


449 


1 


AA426503 


AA426503 zw02bl0.r 


c 


25 


91 


4 


9 


9 


682 


6 


CA431350 


CA431350 UI-H-FG1- 




26 


91 


4 


9 


9 


688 


5 


BX104512 


BX104512 BX104512 




27 


91 


4 


9 


9 


830 


3 


BG678891 


BG678891 602624794 


c 


28 




n 


9 


9 


703 


25 


CU177831 


CU177831 Equus cab 


c 


29 


90 


8 


9 


9 


284 


3 


BF591952 


BF591952 7o22d02.x 


c 


30 


90 


8 


9 


9 


365 


3 


BF476176 


BF476176 naa29c!02. 


c 


31 


90 


8 


9 


9 


421 


1 


AI767873 


AI767873 wi97hll.x 


c 


32 


90 


8 


9 


9 


444 


1 


AI218734 


AI218734 oo07b06.x 


c 


33 


90 


8 


9 


9 


448 


3 


BF516187 


BF516187 UI-H-BW1- 


c 


34 


90 


8 


9 


9 


483 


1 


AI697816 


AI697816 wel7f06.x 


c 


35 


90 


8 


9 


9 


493 


1 


AI670048 


AI670048 we64c!02.x 


c 


36 


90 


8 


9 


9 


508 


1 


AI668692 


AI668692 zb63all.x 


c 


37 


90 


8 


9 


9 


625 


1 


AI990414 


AI990414 wt74all.x 


c 


38 


90 


8 


9 


9 


629 


5 


BU730892 


BU730892 UI-E-CI1- 


c 


39 




39 


9 


7 


755 


24 


AG621567 


AG621567 Macaca fu 


c 


40 


88 


6 


9 


6 


834 


20 


CC062054 


CC06 205 4 MUGQ_CH2 5 




41 


88 


4 


9 


6 


496 


25 


CU026287 


CU026287 Equus cab 




42 


87 


6 


9 


5 


457 


10 


DB489743 


DB489743 DB489743 


c 


43 


87 


4 


9 


5 


349 


18 


M85416 


M85416 EST01931 Fe 




44 


87 


4 


9 


5 


760 


24 


AG6 7114 7 


AG671147 Macaca fu 


c 


45 


87 


2 


9 


5 


739 


25 


CT974461 


CT974461 Equus cab 



ALIGNMENTS 



RESULT 1 
BU183861 
LOCUS 

DEFINITION 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BU183861 920 bp mRNA linear EST 04-SEP-2002 

AGENCOURT_7 96 4 945 NIH_MGC_72 Homo sapiens cDNA clone IMAGE : 6162433 
5 ' , mRNA sequence . 
BU183861 

BU183861.1 GI:22697845 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 

Catarrhini; Hominidae; Homo. 

1 (bases 1 to 920) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 
Email: cgapbs-r@mail.nih.gov 
Tissue Procurement: ATCC/DCTD/DTP 
cDNA Library Preparation: Life Technologies, Inc. 
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cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Agencourt Bioscience Corporation 

Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http : //image . llnl . gov 

Plate: LLAM13516 row: f column: 02 
High quality sequence stop: 633. 
FEATURES Location/Qualifiers 
source 1. .920 

/organism="Homo sapiens" 

/mol_type=" mRNA " 

/db_xref="taxon: 96 06 " 

/clone=" IMAGE: 6162 433" 

/ 1 i s s ue_t ype= " me 1 ano t i c me 1 anoma " 

/lab_host="DH10B (phage-resistant ) " 

/clone_lib="NIH_MGC_72" 

/note="Organ : skin; Vector: pCMV-SPORT6 ; Site_l: Not I; 
Site_2: Sail; Cloned unidirect ionally . Primer: Oligo dT . 
Average insert size 2 kb . Library constructed by Life 
Technologies . " 



Query Match 100.0%; Score 920; DB 5; Length 920; 

Best Local Similarity 100.0%; Pred. No. 1.6e-249; 

Matches 920; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 6 0 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 TCTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTG 6 0 

Qy 61 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 CACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGAC 12 0 

Qy 121 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 TGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCAC 180 

Qy 181 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 AGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATT 2 40 

Qy 241 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAGACCTCCATCAGGTGTCGACAAGGAA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GCTCCTTGATTCTTAACCCACAGAAATTGTGTAAGACCTCCATCAGGTGTCGACAAGGAA 300 

Qy 301 GATCCCAGTAGGGCAGGAGACAGGAGCACCTCTGCTGTGGCCAATGCAGGAATGCTGGCC 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GATCCCAGTAGGGCAGGAGACAGGAGCACCTCTGCTGTGGCCAATGCAGGAATGCTGGCC 36 0 
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Qy 
Db 
Qy 
Db 
Qy 
Db 
Qy 
Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



361 ATCATTGCTTCTGCTGGGCGACTGAGAAGCATCACCCACTTCCCCAGAACCTTTTTTACG 42 0 

I I I I I I I I I I I I I I I I I I I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 ATCATTGCTTCTGCTGGGCGACTGAGAAGCATCACCCACTTCCCCAGAACCTTTTTTACG 42 0 

421 TGGAGTGAAAACTTTAAGGGGCTGTCCAGCTAAACCTCCAACCTCCAGATCCCATGCCAA 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 TGGAGTGAAAACTTTAAGGGGCTGTCCAGCTAAACCTCCAACCTCCAGATCCCATGCCAA 48 0 

481 TTTCTCTGCTTCTGCAAAAGGACTTCAAGTGAAAGACATCTGCAGCTGTGAACGGGGGTA 5 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 TTTCTCTGCTTCTGCAAAAGGACTTCAAGTGAAAGACATCTGCAGCTGTGAACGGGGGTA 540 



600 



5 41 AAACCCTCCCTGCCCCAGGCCCCAAGCAAGGATTTCCCTAGCGGGGAGGAAGGTAGAATC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5 41 AAACCCTCCCTGCCCCAGGCCCCAAGCAAGGATTTCCCTAGCGGGGAGGAAGGTAGAATC 6 0 0 

6 01 GAGAGACCTCTAACCCTGGGAGAGGAGGGAGGGAAATCTCCGAGGACCAGGGTTATGCAA 6 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
6 01 GAGAGACCTCTAACCCTGGGAGAGGAGGGAGGGAAATCTCCGAGGACCAGGGTTATGCAA 6 6 0 

661 CAACACAAGGGAAGTACCTGCTGGGTTCTGGGGGTTGGGGAAGGAAAATCCCTACTGCCC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 CAACACAAGGGAAGTACCTGCTGGGTTCTGGGGGTTGGGGAAGGAAAATCCCTACTGCCC 72 0 

721 CAAGAGCCAGCCCCGAACCCAAGGCACAGCTTATACTGGCCCCGGGGCCTGGGGGGGCAC 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 CAAGAGCCAGCCCCGAACCCAAGGCACAGCTTATACTGGCCCCGGGGCCTGGGGGGGCAC 78 0 

781 GAAAACCTTGAAAAAGGGGCGCCTTCCCAGCTTCCCCGGGGGTAAGGGCTTTACCCCCCA 8 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
781 GAAAACCTTGAAAAAGGGGCGCCTTCCCAGCTTCCCCGGGGGTAAGGGCTTTACCCCCCA 8 40 



841 GAGGGGGGGGGAAAAATCCGAGTGGGATCTTTCCCAACCGCCGAAGACTAAAACCTTTAA 
I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
841 GAGGGGGGGGGAAAAATCCGAGTGGGATCTTTCCCAACCGCCGAAGACTAAAACCTTTAA 

901 ACCCCCAAAGAAACCTTCTA 920 

I I I I I I I I I I I I I I I I I I I I 
901 ACCCCCAAAGAAACCTTCTA 920 



'00 



RESULT 2 
BE733157 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 



BE733157 988 bp mRNA linear EST 15-SEP- 

601567451F1 NIH_MGC_21 Homo sapiens cDNA clone IMAGE : 3842292 5' 
mRNA sequence . 
BE733157 

BE733157.1 GI:10147149 
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KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 

Catarrhini; Hominidae; Homo. 

1 (bases 1 to 988) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Seguencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: image.llnl.gov 
Plate: LLCM535 row: e column: 13 
High guality seguence stop: 703. 
Location /Qualifiers 
1. .988 

/organism="Homo sapiens" 

/mol_type=" mRNA " 

/db_xref="taxon : 96 06 " 

/clone=" IMAGE: 3842292" 

/t is sue_type=" choriocarcinoma" 

/lab_host="DH10B (phage-resistant ) " 

/clone_lib="NIH_MGC_21" 

/note="0rgan : placenta; Vector: pOTB7; Site_l : Xhol; 
Site_2: EcoRI; cDNA made by oligo-dT priming. 
Directionally cloned into EcoRI/XhoI sites using the 
following 5' adaptor: GGCACGAG (G) . Size-selected >500bp 
for average insert size 1.8kb. Library constructed by 
Ling Hong in the laboratory of Gerald M. Rubin (University 
of California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies)." 



Query Match 16 . 3 

Best Local Similarity 90.4 
Matches 160; Conservative 



Score 149.8; DB 3; 
Pred. No. 1.3e-3 0; 
0; Mismatches 17; 



Length 988; 



Indels 



0; Gaps 



QY 



12 7 ACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCACAGCCAA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 ACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCACAGCCAA 



Qy 



3 7 CAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATTGCTCCT 2 46 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 61 CAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATTGCTCCT 12 0 



QY 



24 7 TGATTCTTAACCCACAGAAATTGTGTAAGACCTCCATCAGGTGTCGACAAGGAAGAT 3 03 

I I I I I I I I I I I I I I I I I I I I I I I I I II I III I I I I I I I II 

121 TGATTCTTAACCCACAGAAATTGTGCTTAACACCATGCAGAAGCTGCCAAGGCTTAT 177 



RESULT 3 
DB059978 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



DB059978 533 bp mRNA linear EST 18-JAN-2008 

DB059978 TESTI2 Homo sapiens cDNA clone TESTI2053144 5', mRNA 
sequence . 
DB059978 

DB059978.1 GI:83532935 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 533) 

Kimura,K., Wakamatsu, A . , Suzuki, Y . , Ota,T., Nishikawa, T . , 
Yamashita, R . , Yamamoto, J., Sekine,M., Tsuritani, K . , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M . , 
Murakawa,K., Ishida,S., Ishibashi, T . , Takahashi-Fu j ii, A . , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. and Sugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Email: flj-cdna@nifty.com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan); cDNA library construction: 
Helix Research Institute (HRI); 5 ' -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3 ' -end one 
pass sequencing: RAB. 

Location /Qualifiers 

1. .533 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref="taxon: 9606" 
/clone="TEST!2 053144" 
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/ t is sue_type=" testis " 
/clone_lib="TESTl2" 
/note="Vector : pME18SFL3" 



Query Match 11.7%; Score 108; DB 10; Length 533; 

Best Local Similarity 67.5%; Pred. No. 8.3e-19; 

Matches 181; Conservative 0; Mismatches 85; Indels 2; Gaps 2; 

Qy 9 GGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGCACTTGGT 6 8 

I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 2 GGGAAGCAGCTACCATGTTGTGAGGCTGCTCAAGTAGCCTTGTGGAGAGGTCCACCTGGA 81 

Qy 6 9 GAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGACTGAGCCAC 12 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 82 GAGAAAACAAGGCCTTTGCC-ACAGCCGGCACCAACTTGCCAACCATCTGAGTCAGCCAC 140 

Qy 12 9 TTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCACAGCCAACA 188 

I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I II 
Db 141 CTTGGAAGTGGATCCTCCAGCCTCATTCCAGCCTTCAGATGACCACTGTCCCAGCTCATA 200 

Qy 18 9 ACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATTGCTCCTTG 2 48 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 201 TCTAGACTGCAACCTCATGAGAGACCTGGAGCCAGAA-CACCCAGCTAAGCTGTTCCTGA 25 9 

Qy 249 ATTCTTAACCCACAGAAATTGTGTAAGA 2 76 

I I I I II I I I I I I I I II I I I I 
Db 26 0 ATTCCCGACTCACAGAGACTATGAAAGA 28 7 



RESULT 4 

AW269751/C 

LOCUS 

DEFINITION 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



AW269751 478 bp mRNA linear EST 03-JAN-2000 

xv36gll.xl Soares_NFL_T_GBC_Sl Homo sapiens cDNA clone 
IMAGE : 2815268 3', mRNA sequence. 
AW269751 

AW269751.1 GI:6656781 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 478) 

NCI-CGAP http : //www . ncbi . nlm. nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP), 
Tumor Gene Index 
Unpublished (1997) 

Contact: Robert Strausberg, Ph.D. 



http://es/ScoreAccessWeb/GetItem.action?AppId=105732...8_121059_us-10-573-229a-l.rst&ItemTy P e=4«& (8 of 26)6/15/2009 10:36:10 AM 



SCORE Search Results Details for Application 10573229 and Search Result 20090528_121059_us-10-573-229a-l.rst 



Email: cgapbs-r@mail.nih.gov 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Seq primer: -40UP from Gibco 
High quality sequence stop: 446. 
FEATURES Location/Qualifiers 
source 1. .478 

/organism="Homo sapiens" 

/mo l_t y p e = " mRNA " 

/db_xref =" taxon : 9606 " 

/clone=" IMAGE: 2815268" 

/lab_host="DH10B" 

/clone_lib="Soares_NFL_T_GBC_Sl" 

/note="0rgan: pooled; Vector: pT7T3D-PacI ; Site_l : Not I; 
Site_2: Eco RI; Equal amounts of plasmid DNA from three 
normalized libraries (fetal lung NbHL19W, testis NHT, and 
B-cell NCI_CGAP_GCB1 ) were mixed, and ss circles were made 
in vitro. Following HAP purification, this DNA was used as 
tracer in a subtractive hybridization reaction. The driver 
was PCR-amplif ied cDNAs from pools of 5,000 clones made 
from the same 3 libraries. The pools consisted of 
I.M.A.G.E. clones 297480-302087, 682632-687239, 
726408-728711, and 729096-731399. Subtraction by Bento 
Soares and M. Fatima Bonaldo . " 



Query Match 11.4%; Score 104.8; DB 2; Length 478; 

Best Local Similarity 68.5%; Pred. No. 6.5e-18; 

Matches 174; Conservative 0; Mismatches 77; Indels 3 



Gaps 



Qy 



2 4 TGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGCACTTGGTGAGAAACCGATGCCT 83 
I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
340 TTTCATGAGGATACTCAAGCATTCCTATGGAGAGATCCACATGGTGAGAAACTGAAGCCT 281 



Qy 
Db 
Qy 
Db 
Qy 
Db 
Qy 
Db 



280 



-CTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGACTGAGCCACTTTGGAAGCTGATC 142 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
CCTACCAAGAGCCAGCACCAACTTGCCAGCTATGTGAATGAGCCATCTTAGAAGTGGGTT 2 21 



143 TTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCACAGCCAACAACAAGACTGCAACC 2 02 

I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I 

22 0 CTCTAGCCCTAGTCAGGCCTTCATATGACTGCAGCCAGGGCTGATATTTTGACTACAACC 161 

2 03 TCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATTGCTCCTTGATTCTTAACCCACA 26 2 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
16 0 TCATGAGAGA — C T GAGC C AC AAC AAC C T AGC T AAGAAGC T C C T GAAT T C C C T AC C AAC A 103 

263 GAAATTGTGTAAGA 2 76 

I I I I I III III 
102 GAAAC TAT GT GAGA 89 
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RESULT 5 
BE046932/c 

LOCUS BE046932 484 bp mRNA linear EST 08-JUN-2000 

DEFINITION hc!92f06.x2 NCI_CGAP_GC6 Homo sapiens cDNA clone IMAGE : 2916995 3', 

mRNA sequence . 
ACCESSION BE046932 
VERSION BE046932.1 GI:8363985 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 484) 

AUTHORS NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE National Cancer Institute, Cancer Genome Anatomy Project (CGAP), 

Tumor Gene Index 
JOURNAL Unpublished (1997) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Christopher A. Moskaluk, M.D., Ph.D., Michael 
R. Emmert-Buck, M.D., Ph.D. 

cDNA Library Preparation: M. Bento Soares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing Center 
Clone distribution: NCI-CGAP clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL, send email to: 
inf o@image . llnl . gov 
Seq primer: -40UP from Gibco 
High quality sequence stop: 476. 
FEATURES Location/Qualifiers 
source 1. .484 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 96 06 " 
/clone=" IMAGE: 2916 995" 

/tissue_type="pooled germ cell tumors" 

/lab_host="DH10B" 

/clone_lib="NCI_CGAP_GC6" 

/note="Vector : pT7T3D-PacI; Site_l : Not I; Site_2 : Eco RI; 
Plasmid DNA from the normalized library NCI_CGAP_GC4 was 
prepared, and ss circles were made in vitro. Following HAP 
purification, this DNA was used as tracer in a subtractive 
hybridization reaction. The driver was PCR-amplif ied cDNAs 
from a pool of 5, 000 clones made from the same library 
(clonelDs 1257096-1258631, 1469064-1470983, and 
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1475592-1476743). Subtraction by Bento Soares and M. 
Fatima Bonaldo. " 



Query Match 11.4%; Score 104.8; DB 2; Length 484; 

Best Local Similarity 68.5%; Pred. No. 6.6e-18; 

Matches 174; Conservative 0; Mismatches 77; Indels 3; Gaps 2; 

Qy 2 4 TGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGCACTTGGTGAGAAACCGATGCCT 83 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 33 8 TTTCATGAGGATACTCAAGCATTCCTATGGAGAGATCCACATGGTGAGAAACTGAAGCCT 2 79 

Qy 8 4 -CTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGACTGAGCCACTTTGGAAGCTGATC 142 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II 
Db 2 78 CCTACCAAGAGCCAGCACCAACTTGCCAGCTATGTGAATGAGCCATCTTAGAAGTGGGTT 219 

Qy 143 TTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCACAGCCAACAACAAGACTGCAACC 2 02 

I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I 

Db 218 CTCTAGCCCTAGTCAGGCCTTCATATGACTGCAGCCAGGGCTGATATTTTGACTACAACC 15 9 

Qy 203 TCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATTGCTCCTTGATTCTTAACCCACA 262 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 15 8 TCATGAGAGA — C T GAGC C AC AAC AAC C T AGC T AAGAAGC T C C T GAAT T C C C T AC C AAC A 101 

Qy 263 GAAATTGTGTAAGA 2 76 

I I I I I III III 
Db 10 0 GAAACTATGTGAGA 87 



RESULT 6 
DB100179 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



DB100179 500 bp mRNA linear EST 18-JAN-2008 

DB100179 TESTI4 Homo sapiens cDNA clone TESTI4052031 5', mRNA 
sequence . 
DB100179 

DB100179.1 GI:83431172 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 500) 

Kimura,K., Wakamatsu, A . , Suzuki , Y . , Ota,T., Nishikawa, T . , 
Yamashita, R . , Yamamoto, J., Sekine,M., Tsuritani, K . , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono, Y. , Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M . , 
Murakawa,K., Ishida,S., Ishibashi , T . , Takahashi-Fu j ii, A . , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. and Sugano,S. 
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TITLE Diversification of Transcriptional Modulation: Large-scale 

Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
JOURNAL Genome Res. 16 (1), 55-65 (2006) 
PUBMED 16344560 
COMMENT Contact: Takao Isogai 

FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari , Kisarazu, Chiba, 292-0818, Japan 
Email: flj-cdna@nifty.com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan); cDNA library construction: 
Helix Research Institute (HRI); 5 ' -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3 ' -end one 
pass sequencing: RAB. 
FEATURES Location/Qualifiers 
source 1. .500 

/organism="Homo sapiens" 

/ mo l_t ype= " mRNA " 

/db_xref="taxon: 9606" 

/clone="TESTI4 052 031" 

/tissue_type="testis" 

/clone_lib="TESTI4" 

/note="Vector : pME18SFL3" 



Query Match 11.3%; Score 104.4; DB 10; 

Best Local Similarity 67.8%; Pred. No. 8.6e-18; 
Matches 175; Conservative 0; Mismatches 81; 



Length 5 00; 



Gaps 



Qy 



1 9 TGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGCACTTGGTGAGAAACCGA 7 8 
II III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 TACCATGTTGTGAGGCTGCTCAAGTAGCCTTGTGGAGAGGTCCACCTGGAGAGAAAACAA 6 0 



Qy 
Db 
Qy 
Db 
Qy 
Db 
Qy 



7 9 TGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGACTGAGCCACTTTGGAAGCT 13 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 1 GGCCTTTGCC-ACAGCCGGCACCAACTTGCCAACCATCTGAGTCAGCCACCTTGGAAGTG 119 

13 9 GATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCACAGCCAACAACAAGACTGC 19 8 

I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 0 GATCCTCCAGCCTCATTCCAGCCTTCAGATGACCACTGTCCCAGCTCATATCTAGACTGC 179 

19 9 AACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATTGCTCCTTGATTCTTAACC 25 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
18 0 AACCTCATGAGAGACCTGGAGCCAGAA-CACCCAGCTAAGCTGTTCCTGAATTCCCGACT 23 8 

25 9 CACAGAAATTGTGTAAGA 2 76 
I I I I I I I I II I I I I 
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Db 23 9 CACAGAGACTATGAAAGA 25 6 



RESULT 7 

CC492937/C 

LOCUS 

DEFINITION 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



Smailus,D., Prabhu,A.-L 
Olson, T., Mayo,M., 
Guin,R., Chan, A. 



Chiu,R. , 



CC492937 757 bp DNA linear GSS 17-JUN-2003 

CH240_327A8 .T7 CHORI-240 Bos taurus genomic clone CH240_327A8, 
genomic survey sequence. 
CC492937 

CC492937.1 GI:31803851 
GSS. 

Bos taurus (cattle) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; 
Pecora; Bovidae; Bovinae; Bos. 
1 (bases 1 to 757) 

Holt,R., Stott,J., Yang,G., Barber, S. 
Tsai,M., Cloutier,A., Lee,D., Girn,N., 
Butterf ield, Y . , Kirkpatrick, R . , Liu, J. 
Mathewson, C . , Wye,N., Masson,A., Brown- John, M . , Jones, S., 
Schein,J., Marra,M., de Jong, P . , McWilliam, S . , Barris,W., 
Dalrymple, B . P . and Tellam, R. 

Bovine BAC End Sequences from Library CHORI-240, PLATES 294 to 398 
Unpublished (2003) 

Other_GSSs : CH240_327A8 . TARBAC13P2 

Contact: Rob Holt 

Sequencing 

The British Columbia Cancer Agency Genome Science Centre 
600 W. 10th Ave, Vancouver, British Columbia, Canada V5Z 4E6 
Tel: 604-877-6085 
Fax: 604-877-6276 
Email: rholt @bcgsc . ca 

Clones are derived from the bovine BAC library CHORI-240 
(http://www.chori.org/bacpac/bovine240.htm). For BAC library 
availability, please contact Pieter de Jong (pdejong@mail.cho.org). 
Clones may be purchased from BACPAC Resources 

(http://www.chori.org/bacpac/ordering_information.htm) . This work 
was undertaken as part of the International Bovine BAC Mapping 
Consortium (IBBMC) by CSIRO Livestock Industries, Australia and the 
British Columbia Genome Sciences Centre, Canada. 
Plate: 327 row: A column: 8 
Seq primer: T7 
Class: BAC ends. 

Location /Qualifiers 
1. .757 

/organism="Bos taurus" 
/mol_type=" genomic DNA" 
/strain="breed : Hereford" 
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/db_xref="taxon: 9913" 

/clone="CH2 4 0_32 7A8" 

/sex="Male" 

/ eel l_type=" Blood" 

/clone_lib="CHORI-240" 

/note="Vector : pTARBAC1.3; Site_l : Mbol; Site_2: Mbol; 
Hereford bull LI Domino 99375; CHORI-240 Bovine BAC 
library (Male) produced by Pieter de Jong" 



Query Match 11.2%; Score 103.2; DB 20; Length 757; 

Best Local Similarity 64.1%; Pred. No. 2.1e-17; 

Matches 173; Conservative 0; Mismatches 93; Indels 4; Gaps 1; 

Qy 2 CTGTAGAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGC 61 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 612 CTCTTGGGGAAGTTGGCTTGCATTCTATGAGGACATTCACATGGCCCTATGGAGATATCT 553 

Qy 6 2 ACTTGGTGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGACT 121 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 55 2 GTGTGGTGAGGAACTGAGGCTGTTGACAACACCCAGCACTATCTTTCTGGGTGTGTGAAT 4 93 

Qy 122 GAGCCAC TTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGC 177 

II II I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I 

Db 492 GAATGACCAAATTTGGAAGAAAATCTTCCAGCCTCAGTCAGGCCTTCAGGTGACTGTAGC 433 

Qy 178 CACAGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAA 23 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I III II I II I I I I 
Db 432 CGCAGCCAACATCTAGACTGCAACCTCCTGAGAGACTCTGAACCATAACCACCCAACTAA 3 73 

Qy 23 8 ATTGCTCCTTGATTCTTAACCCACAGAAAT 26 7 

III II I I I I I I I I I I III 
Db 372 GTTGTCCCCGAATTCCTGACCCATGAAAGT 343 



RESULT 8 
DB073121 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 



DB073121 526 bp mRNA linear EST 18-JAN-2008 

DB073121 TESTI4 Homo sapiens cDNA clone TESTI4016394 5', mRNA 
sequence . 
DB073121 

DB073121.1 GI:83564345 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 526) 
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AUTHORS Kimura,K., Wakamatsu, A . , Suzuki, Y . , 0ta,T., Nishikawa, T . , 

Yamashita, R . , Yamamoto, J . , Sekine,M., Tsuritani, K . , Wakaguri,H., 
Ishii,S., Sugiyama, T . , Saito,K., Isono, Y. , Irie,R., Kushida,N., 
Yoneyama,T., 0tsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M . , 
Murakawa,K., Ishida,S., Ishibashi , T . , Takahashi-Fu j ii, A . , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. and Sugano,S. 

TITLE Diversification of Transcriptional Modulation: Large-scale 

Identification and Characterization of Putative Alternative 
Promoters of Human Genes 

JOURNAL Genome Res. 16 (1), 55-65 (2006) 
PUBMED 16344560 
COMMENT Contact: Takao Isogai 

FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Email: flj-cdna@nifty.com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan); cDNA library construction: 
Helix Research Institute (HRI); 5 ' -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3 ' -end one 
pass sequencing: RAB. 
FEATURES Location/Qualifiers 
source 1. .526 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="t axon: 96 06" 

/clone="TESTI4 0163 94" 

/tissue_type="testis" 

/clone_lib="TESTI4" 

/note="Vector : pME18SFL3" 



Query Match 11 . 0%; 

Best Local Similarity 65.7%; 
Matches 178; Conservative 



Score 101.4; DB 10; Length 526; 
Pred. No. 6.2e-17; 
0; Mismatches 91; Indels 2; Gaps 



QY 



7 GAGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGCACTTG 6 6 
I I I I I I I I I I I I I I I I III I I I I I I I I I I I II 

1 0 GGGGGAAGTTGCTATGTTGTGAGGGCACTCATGCAGCCCATCAATGGTGAGATTCATGTG 6 9 



Qy 

Db 

Qy 
Db 



6 7 GTGAGAAACCGATGCCT-CTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGACTGAGC 125 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7 0 CCAAGGAACTGAGGCCTCCTGCCAATAACCAGCACTAACTTGCCAGACATATGAGTGAGC 12 9 

12 6 CACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCACAGCCA 185 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

13 0 TACCTCAGAAGCAAATCCTCCAGCCCCAGTCAAGCCTTCAGATGATTACAGCCCTGGCTA 18 9 
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Qy 186 ACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATTGCTCC 2 45 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 19 0 ATATCTTGACTG-AACTTCATGTGAGACCTTGAGCCAGAATCATCTAGCTAAGGCATTCC 2 48 

Qy 2 46 TTGATTCTTAACCCACAGAAATTGTGTAAGA 2 76 

I I I I I I I I I I I I I I MM Ml III 
Db 249 TGGATTCCTGCCCCACAGCAATTATGTGAGA 2 79 



RESULT 9 
EI735387 

LOCUS EI735387 669 bp DNA linear GSS 28-OCT-2007 

DEFINITION CHORI51421012TR BAC library from the primary breast tumor sample 

sl04 Homo sapiens genomic clone CHORI514_21_012, genomic survey 

sequence . 
ACCESSION EI735387 

VERSION EI735387.1 GI:158756676 

KEYWORDS GSS. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchont oglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 669) 

AUTHORS Raphael, B. J. , Volik,S.V., Yu,P., Wu,C, Huang, G.Q., Waldman,F., 
Costello, J . , Pienta,K., Mills, G., Ba j sarowicz , K . , Kobayashi, Y . , 
Sridharan, S . , Paris, P., Tao,Q.Z., Gray, J. W., Cheng, J. F., de 
Jong, P., Nefedov,M., Aerni,S., Brown, R. P., Bashir,A., 
Padilla-Nash, H . M . and Collins, C.C. 
TITLE A sequence-based survey of the complex structural organization of 

tumor genomes 
JOURNAL Unpublished (2007) 
COMMENT Contact: Volik SV 

Colin Collins' lab 
UCSF Department of Urology 

UCSF Box 0808, San Francisco, CA 94143-0808, USA 

Tel: 415 502 7067 

Fax: 415 476 8218 

Email: svolik@cc.ucsf.edu 

Seq primer: KBR/TJ 5 ' CTGGCCGTCGACATTTAGG-3 ' . 
FEATURES Location/Qualifiers 
source 1. .669 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db_xref="t axon: 96 06" 
/clone="CHORI514_21_012" 
/tissue_type="Primary tumor" 

/clone_lib="BAC library from the primary breast tumor 
sample sl04" 
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/note="Organ : Breast; Vector: pBACGKl.l; Site_l: Hindlll; 
This library was constructed from sl04 primary breast 
tumor sample by Dr. M. Nefedov (Dr. Peter de Jong's 
laboratory) . The patient did not receive chemo- or 
radiotherapy and did not have recurrence for 10 years 
after surgery . " 



Query Match 11.0%; Score 101.2; DB 22; Length 669; 

Best Local Similarity 62.8%; Pred. No. 7.6e-17; 

Matches 15 7; Conservative 0; Mismatches 93; Indels 0; Gaps 0; 

Qy 18 CTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGCACTTGGTGAGAAACCG 7 7 

I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 13 0 CTGCCATGTCGTGAGAACACACAAGCAGCCCCAAGGAGGGGTCCGTGTGGTAAGGAACTG 189 

Qy 78 ATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGACTGAGCCACTTTGGAAGC 13 7 

I I I I I I I I II I I I I III I I I I I I I I I I I I I I I 

Db 19 0 AGACCCCTGCTGGCAGCCAACACCACTTTGCCAACATTGTGAGTGACCCATCTTGGACAT 2 49 

Qy 13 8 TGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCACAGCCAACAACAAGACTG 19 7 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 25 0 AGGTTCCTGACATCCTGTGAAGCCTTCAGCTGACTGCAGCCCTGTCTGACATCTTGACTG 309 

Qy 19 8 CAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATTGCTCCTTGATTCTTAAC 25 7 

I I I I I I I I I I I I I I I III II I I I I I I I I I I I I I I I I 

Db 310 CAACCTCATGAGAGATCCCAAACAAGAACTACCCAGCTAAGCCATTCCCTGATTCCTGAT 36 9 



Qy 25 8 CCACAGAAAT 26 7 

I I I I I I I I I I 
Db 3 70 CCACAGAAAT 3 79 



RESULT 10 

BQ021722/c 

LOCUS 

DEFINITION 



REFERENCE 
AUTHORS 
TITLE 



BQ021722 702 bp mRNA linear EST 27-MAR-2002 

UI-H-DHl-axi-n-10-0-UI . si NCI_CGAP_DH1 Homo sapiens cDNA clone 
IMAGE : 5829321 3', mRNA sequence. 
BQ021722 

BQ021722.1 GI:19757000 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 702) 

NCI-CGAP http : / / www . ncbi . nlm. nih . gov/ ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP), 
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Tumor Gene Index 
JOURNAL Unpublished (1997) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Jose Mercuende 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Clone distribution information can be found 

through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
The following repetitive elements were found in this cDNA 

sequence: 16-517, >MLTlF#LTR/MaLR (matched compliment) 

Seq primer: M13 FORWARD 

POLYA=Yes . 

FEATURES Location/Qualifiers 
source 1 . .702 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/ db_xref =" taxon : 96 06 " 
/clone=" IMAGE: 5 82 9321" 

/ t is sue_type= "Met astatic Chondrosarcoma" 
/dev_stage= "Adult " 

/lab_host="DH10B (Life Technologies)" 
/ c lone_l ib= " NC I_CGAP_DH1 " 

/note="0rgan : Lung; Vector: pT7T3-Pac (Pharmacia) with a 
modified polylinker; Site_l : EcoR I; Site_2 : Not I; 
NCI_CGAP_DH1 is a normalized cDNA library containing the 
following tissue(s): VS-8 Cell line from Metastatic 
Chondrosarcoma in Lung. The library was constructed 
according to Bonaldo, Lennon and Soares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned direct ionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis o 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT)18 tail. The 
sequence tag for this library is AGATCATTGC . 
TAG_TISSUE=lung 
TAG_LIB=UI-H-DH1 
TAG_SEQ=AGATCATTGC " 

ORIGIN 

Query Match 11.0%; Score 101.2; DB 4; Length 702; 

Best Local Similarity 62.8%; Pred. No. 7.7e-17; 

Matches 15 7; Conservative 0; Mismatches 93; Indels 0; Gaps 0 

Qy 18 CTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGCACTTGGTGAGAAACCG 7 7 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 341 CTGCCATGTCGTGAGAACACACAAGCAGCCCCAAGGAGGGGTCCGTGTGGTAAGGAACTG 2 82 



Qy 78 ATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGACTGAGCCACTTTGGAAGC 13 7 

I I I I I I I I II I I I I III I I I I I I I I I I I I I I I 

Db 281 AGACCCCTGCTGGCAGCCAACACCACTTTGCCAACATTGTGAGTGACCCATCTTGGACAT 222 

Qy 13 8 TGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCACAGCCAACAACAAGACTG 19 7 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 221 AGGTTCCTGACATCCTGTGAAGCCTTCAGCTGACTGCAGCCCTGTCTGACATCTTGACTG 162 

Qy 19 8 CAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATTGCTCCTTGATTCTTAAC 25 7 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 161 CAACCTCATGAGAGATCCCAAACAAGAACTACCCAGCTAAGCCATTCCCTGATTCCTGAT 102 

Qy 25 8 CCACAGAAAT 26 7 

I I I I I I I I I I 
Db 101 CCACAGAAAT 92 



RESULT 11 

DB074846 

LOCUS 

DEFINITION 



REFERENCE 
AUTHORS 



JOURNAL 
PUBMED 
COMMENT 



DB074846 549 bp mRNA linear EST 18-JAN-2008 

DB074846 TESTI4 Homo sapiens cDNA clone TESTI4018555 5', mRNA 
sequence . 
DB074846 

DB074846.1 GI:83099624 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchont oglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 549) 

Kimura,K., Wakamatsu, A . , Suzuki, Y., Ota,T., Nishikawa, T . , 
Yamashita, R . , Yamamoto, J., Sekine,M., Tsuritani, K . , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono, Y. , Irie,R., Kushida,N., 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M . , 
Murakawa,K., Ishida,S., Ishibashi, T . , Takahashi-Fu j ii, A . , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. and Sugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Email: flj-cdna@nifty.com 
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NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan); cDNA library construction: 
Helix Research Institute (HRI); 5 ' -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3 ' -end one 
pass sequencing: RAB. 
FEATURES Location/Qualifiers 
source 1. .549 

/organism="Homo sapiens" 

/mol_type=" mRNA " 

/db_xref="taxon: 96 06" 

/clone="TESTI40 18555" 

/tissue_type="testis" 

/clone_lib="TESTI4" 

/note="Vector : pME18SFL3" 



Query Match 10.4%; Score 95.4; DB 10; Length 549; 

Best Local Similarity 63.1%; Pred. No. 3.2e-15; 

Matches 171; Conservative 0; Mismatches 86; Indels 14; Gaps 1; 

Qy 17 GCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGCACTTGGTGAGAAACC 76 

I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 7 GCTGCCCTGTCATGACGGTATGCATGCAGCCCCATGGAGAGGCCCACATCATGAGGAACT 6 6 

Qy 7 7 GATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGACTGAGCCACTTTGGAAG 136 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 6 7 AA CCAAGCACTAACTTGCCTGGTGTTTCAATAGGCCACTTTGGAGG 112 

Qy 13 7 CTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCACAGCCAACAACAAGACT 196 

I I I I I III I I I I I I I I I I I I I I I I I I I I II I II I I I I 

Db 113 CAGATTCTCCAGCCTTGGTCAGGCCTTCAGATGACTGCAGCCCTGGCTGATGACTTGACT 172 

Qy 19 7 GCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATTGCTCCTTGATTCTTAA 256 

II I I I I I II I II I I I I I I I ! I I I I II II II I I I I I I I I I I I I 

Db 173 GCCACCTCATGAGAGACCTTGAGCCAGAACCACCCAGCCAAGCCACTCCTGGATTCCTGA 232 

Qy 25 7 CCCACAGAAATTGTGTAAGACCTCCATCAGG 28 7 

I I I I I I I I I I I I I I I I I I II 
Db 233 TCCACAGAAACTAGGTGAAACGTCAAGAATG 263 



RESULT 12 

AW297623/C 

LOCUS 

DEFINITION 



AW297623 446 bp mRNA linear EST 16-JAN-2000 

UI-H-BW0-ajg-f-08-0-UI . si NCI_CGAP_Sub6 Homo sapiens cDNA clone 
IMAGE =2 731910 3', mRNA sequence. 

AW297623 

AW297623.1 GI:6704259 
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KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 446) 

AUTHORS NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE National Cancer Institute, Cancer Genome Anatomy Project (CGAP), 

Tumor Gene Index 
JOURNAL Unpublished (1997) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

The seguence contained an oligo-dT track that was present in the 
oligonucleotide that was used to prime the synthesis of first 
strand cDNA and therefore this may represent a bonafide poly A 
tail. cDNA Library Preparation: M.B. Soares Lab Clone distribution: 
NCI-CGAP clone distribution information can be found through the 
I.M.A.G.E. Consortium/LLNL at: 

www-bio.llnl.gov/bbrp/image/image.html The following repetitive 
elements were found in this cDNA seguence: 49-424, >MLTlF#LTR/MaLR 
Seg primer: M13 Forward 
POLYA=Yes . 

FEATURES Location/Qualifiers 
source 1. .446 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 2 731910" 

/lab_host="DH10B (Life Technologies)" 
/ c lone_l ib= " NC I_CGAP_Sub6 " 

/note="Vector : pT7T3D-PacI; Site_l : Not I; Site_2 : Eco RI; 
NCI_CGAP_Sub6 is a subtracted library derived from BW, 
which consists of a mixture of four normalized libraries: 
NCI_CGAP_Brn5 0, NCI_CGAP_Lul3 , NCI_CGAP_0vl8 , GBC1 . The 
NCI_CGAP_Sub6 library had 7 million recombinants. A 
single-stranded DNA preparation of BW was used as a tracer 
in a subtractive hybridization with a driver comprising: 
the IMAGE pool (NCI_CGAP_Kid3 pool 1 LLAM 3334-3337, 
3682-3683, 3798-3803 (IMAGE ClonelDs 1322376-1323911, 
1456008-1456775,1500552-1502855); NCI_CGAP_Kid5 pool 1 
LLAM 3338-3342,3722-3725, 3776-3778 (IMAGE ClonelDs 
1323912-1325831, 1471368-1472903, 1492104-1493255); 
NCI_CGAP_Lu5 pool 1 LLAM 3575-3582, 3851-3854 (IMAGE 
ClonelDs 1414920-1417991, 1520904-1522439); NCI_CGAP_GC4 
pool 1 LLAM 3164-3167, 3716-3720, 3733-3735 (IMAGE 
ClonelDs 125 70 96-1258631, 146 9 06 4-1470 983, 
1475592-1476743); NCI_CGAP_Pr22 pool 1 LLAM 2457-2459, 
2758-2759, 3062-3068 (IMAGE ClonelDs 985608-986759, 
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1101192-1101959, 1217928-1220615); NCI_CGAP_ColO pool 1 
LLAM 2644-2653, 2871-2872 (IMAGE ClonelDs 
1057416-1061255,1144584-1145351). (50% of the driver 
population), plus a pool of 3,840 arrayed clones from 
NCI_CGAP_Subl (IMAGE ClonelDs 2708616-2710535) and 
NCI_CGAP_Sub2 (IMAGE ClonelDs 2710536-2712455) (20% of 

the driver population), plus a pool of 11,136 clones from 
NCI_CGAP_Sub3 (IMAGE ClonelDs 2712456-2723591) (30% of 
the driver population) . Subtraction was performed as 
previously described [Bonaldo, Lennon & Soares (1996): 
Normalization and Subtraction: Two Approaches To 
Facilitate Gene Discovery. Genome Research 6, 791-806. 
TAG_TISSUE=B cells germinal 
TAG_LIB=Gbcl 
TAG_SEQ=TCA" 



Query Match 10 . 3%; 

Best Local Similarity 67.1%; 
Matches 149; Conservative 



Score 94.8; DB 2; 
Pred. No. 4.5e-15; 
0; Mismatches 72; 



Length 446; 



Indels 



Gaps 



1; 



Qy 



4 7 CCAGTGGAGAGGTGCACTTGGTGAGAAACCGATG-CCTCTGCCAACCACCTGCACTAACC 105 
II I II I II I II I I I I I II I I I I I I I I I I I I I I I I I III 
32 8 CCTATGGAGATGCCCATGTGGGGAAGAATGGAGGTCCCCTGCCAACAATCAGCTTTGACA 26 9 



Qy 

Db 

Qy 

Db 

Qy 
Db 



106 TGCTGGGTCTGAGACTGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTA 16 5 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

26 8 TGCCAAGCCTGTGAGGGAGCCATCTGGAAAGGGAGTCTTCCAGCCCCAATCAGGCCTTCA 209 

16 6 GCTGGCTGCAGCCACAGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAA 225 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
20 8 GATGACTGCAGCCCTGGCCAGCTCCTTGACTGCAACCTCATGACAGACCTTGAGCAAGAA 149 

22 6 TCCCCTGGCTAAATTGCTCCTTGATTCTTAACCCACAGAAAT 26 7 

I I I I I I I I I I II I I I I I I I I II I I I I I I I I 
148 C T GC T C AGC T AAGC T GC T T C T GAAT TC T T AAC C T AC AGAAAT 107 



RESULT 13 

AA535137/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



AA535137 

nf87hl2.sl NCI_CGAP_Co3 
mRNA seguence . 
AA535137 

AA535137.1 GI:2279390 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata 



533 bp mRNA linear EST 21-AUG-1997 
Homo sapiens cDNA clone IMAGE: 926951 3', 



Craniata; Vertebrata; Euteleostomi ; 
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Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 533) 

AUTHORS NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE National Cancer Institute, Cancer Genome Anatomy Project (CGAP), 

Tumor Gene Index 
JOURNAL Unpublished (1997) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Elias Campo, M.D., Michael R. Emmert-Buck, 
M.D., Ph.D. 

cDNA Library Preparation: M. Bento Soares, Ph.D. 
cDNA Library Arraying: Greg Lennon, Ph.D. 

DNA Seguencing by: Washington University Genome Seguencing Center 
Clone distribution: NCI-CGAP clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
www-bio . llnl . gov/bbrp/ image/ image . html 
Insert Length: 826 Std Error: 0.00 
Seg primer: -40ml3 fwd. ET from Amersham 
High guality seguence stop: 440. 
FEATURES Location/Qualifiers 
source 1. .533 

/organism="Homo sapiens" 

/mol_type=" mRNA " 

/db_xref="taxon : 96 06 " 

/clone=" IMAGE: 926 951" 

/sex="pooled" 

/ t is sue_type=" colon" 

/lab_host="DH10B" 

/clone_lib="NCI_CGAP_Co3" 

/note="Vector : pT7T3D-PacI; Site_l : Not I; Site_2 : Eco RI; 
1st strand cDNA was prepared from 12 pooled bulk tumor 
samples and primed with a Not I - oligo(dT) primer. 
Double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization. " 



Query Match 10 . 3%; 

Best Local Similarity 62.3%; 
Matches 175; Conservative 



Score 94.6; DB 1; 
Pred. No. 5.3e-15; 
0; Mismatches 89; 



Length 533; 



Indels 



17; Gaps 



QY 



8 AGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGCACTTGG 6 7 
I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I 

4 7 8 AGGGAGAAGTCAGCTGTCATGCGAGGGCACTCAAGCAGCCCTGTGGAAAGCTCTACGTGG 419 



Qy 



6 8 TGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCT GCTG 110 

I I I I I I I I I I I I I I I I ! MM M IM M 
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Db 418 T GAGAAAC T GAAGC C T C C T GC C AAC AC C AAC AAGGAAC T GAGAC C T AC T T C C AAC AGC C A 35 9 

Qy 111 GGTCTGAGACTGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGG 170 

I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 35 8 TCTGAGTGATCGATCCATCCTGGAATCAGATCCTCCAGCCCCAGTCAAGTCTTCAGATGA 299 

Qy 171 CTGCAGCCACAGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCC 23 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 298 CTGCAGCCCTGACCAGCATCTCAACTGCAACCTCGTGAATGACCCTGAGCCAGAACCATG 239 

Qy 231 TGGCTAAATTGCTCCTTGATTCTTAACCCACAGAAATTGTG 2 71 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 23 8 CAACTAAACTCTGCGTGGATTCATGACCCACAGAAACTGTG 198 



RESULT 14 
AW9 70418 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



JOURNAL 
COMMENT 



FEATURES 

source 



AW970418 637 bp mRNA linear EST 01-JUN-2000 

EST382499 MAGE resequences, MAGK Homo sapiens cDNA, mRNA sequence. 
AW970418 

AW970418.1 GI:8160263 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
1 (bases 1 to 637) 

Hegde,P., Qi, R . , Abernathy , K . , Dharap, S . , Gaspard,R., Gay,C, 
Holt, I.E., Saeed,A.I., Sharov,V., Lee, N . H . , Yeatman,T.J. and 
Quackenbush, J . 

Assessment of gene expression patterns in a model of colon tumor 

metastasis using a 19,200 element cDNA microarray 

Unpublished (2000) 

Contact: John Quackenbush 

The Institute for Genomic Research 

9712 Medical Center Dr., Rockville, MD 20850, USA 

Tel: 301 838 3528 

Fax: 301 838 0208 

Email: johnq@tigr.org 

Plate: 277 

Seq primer: Forward. 

Location /Qualifiers 

1. .637 

/organism="Homo sapiens" 
/mol_type=" mRNA " 
/db_xref="t axon: 96 06" 
/clone_lib="MAGE resequences, MAGK" 
/note="Vector : pBluescript SKm" 
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Query Match 10.3%; Score 94.6; DB 2; Length 63 7; 

Best Local Similarity 62.3%; Pred. No. 5.6e-15; 

Matches 175; Conservative 0; Mismatches 89; Indels 17; Gaps 1; 

Qy 8 AGGGGAATGGCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGCACTTGG 6 7 

I I I I I I I I I I I I III! I I I I I I I I I I I I I I I I I I I I I I 

Db 2 74 AGGGAGAAGTCAGCTGTCATGCGAGGGCACTCAAGCAGCCCTGTGGAAAGCTCTACGTGG 333 

Qy 6 8 TGAGAAACCGATGCCTCTGCCAACCACCTGCACTAACCT GCTG 110 

I I I I I I I I I I I I I I I I I I I I I II III II 
Db 33 4 T GAGAAAC T GAAGC C T C C T GC C AAC AC C AAC AAGGAAC T GAGAC C T AC T T C C AAC AGC C A 3 93 

Qy 111 GGTCTGAGACTGAGCCACTTTGGAAGCTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGG 170 

I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 394 TCTGAGTGATCGATCCATCCTGGAATCAGATCCTCCAGCCCCAGTCAAGTCTTCAGATGA 453 

Qy 171 CTGCAGCCACAGCCAACAACAAGACTGCAACCTCCTGGGGGATCCTGAGCCAGAATCCCC 23 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 45 4 CTGCAGCCCTGACCAGCATCTCAACTGCAACCTCGTGAATGACCCTGAGCCAGAACCATG 513 

Qy 231 TGGCTAAATTGCTCCTTGATTCTTAACCCACAGAAATTGTG 271 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 514 CAACTAAACTTTGCGTGGATTCATGACCCACAGAAACTGTG 55 4 



RESULT 15 

CE539857/C 

LOCUS 

DEFINITION 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
PUBMED 
COMMENT 



CE539857 429 bp DNA linear GSS 28-SEP-2003 

tigr-gss-dog-17000366002738 Dog Library Canis lupus familiaris 
genomic, genomic survey sequence. 
CE539857 

CE539857.1 GI:36856638 
GSS. 

Canis lupus familiaris (dog) 
Canis lupus familiaris 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Laurasiather ia; Carnivora; Caniformia; Canidae; 
Canis . 

1 (bases 1 to 429) 

Kirkness , E . F . , Bafna,V., Halpern, A . L . , Levy,S., Remington, K . , 
Rusch,D.B., Delcher , A . L . , Pop,M., Wang,W., Fraser,C.M. and 
Venter, J. C. 

The dog genome: survey sequencing and comparative analysis 

Science 301 (5641), 1898-1903 (2003) 

14512627 

Contact: Kirkness EF 

The Institute for Genomic Research 
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Department of Eukaryotic Genomics, TIGR, 9712 Medical Center Drive, 
Rockville, MD 20850, USA 
Tel: 301-838-0200 
Fax: 301-838-0208 
Email: ekirknes@tigr.org 
Class: shotgun. 
FEATURES Location/Qualifiers 
source 1. .429 

/organism="Canis lupus familiaris" 

/mol_type=" genomic DNA" 

/strain="Standard Poodle" 

/ sub_species=" familiaris " 

/db_xref ="taxon : 9615" 

/clone_lib="Dog Library" 

/note=" Site_l : BstXI; Libraries were prepared from 
peripheral blood" 



Query Match 10.2%; Score 94.2; DB 20; Length 429; 

Best Local Similarity 63.4%; Pred. No. 6.5e-15; 

Matches 144; Conservative 0; Mismatches 83; Indels 0; Gaps 0; 

Qy 17 GCTGCTGTGTCATGGGGGTGCATGAGCAGCCCAGTGGAGAGGTGCACTTGGTGAGAAACC 76 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 312 GCTGCCATGTCAATAGGATCCTCAAAGACTCTTGTGGAGAAGTTCACATGGTAAGAAACT 253 

Qy 7 7 GATGCCTCTGCCAACCACCTGCACTAACCTGCTGGGTCTGAGACTGAGCCACTTTGGAAG 136 

I I I I I I I I I II I I I I III I I I I I I I I I I I I I I I I 

Db 25 2 GAGGCTTAATGCTAACACCAGCATCAGTTTGCCAACACTGGGAGTGAGCTCTCTTGGAAA 193 

Qy 13 7 CTGATCTTGGAGCACCAGTCAAGCCCTTAGCTGGCTGCAGCCACAGCCAACAACAAGACT 196 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 192 TGAATCCTTTAGGCCAAGTCAAGCCTTTGGATAACCATAGCCCTGGCCCACATCTTGACT 133 

Qy 19 7 GCAACCTCCTGGGGGATCCTGAGCCAGAATCCCCTGGCTAAATTGCT 2 43 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 132 GC AAC T T CAT GAGA GAC C C TGAGC C AGAAC C AC C C AGC AAAGC T GC T 86 



Search completed: May 31, 
Job time : 1892 sees 



2009, 22:23:27 
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